AITopics | momentum network

Appendix Meta-Learning with Self-Improving Momentum Target AOverview of terminologies used in the paper

Neural Information Processing SystemsApr-25-2026, 05:11:29 GMT

The meta-learner network, i.e., learns to generalize on new tasks. Algorithm for adapting the meta-model into a task expert by using a given task dataset. Support set S. A dataset sampled from a given task distribution that is used for the adaptation. Query set Q. A dataset sampled from a given task distribution (that is disjoint with the support set) to evaluate the adaptation performance of the algorithm. Network adapted from the meta-model using the support set by using the adaptation subroutine, i.e., Adapt(,S) Momentum network moment.

artificial intelligence, experiment, machine learning, (14 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)

Add feedback

29440165fee0471389ba3f80a7b3f95f-Paper-Conference.pdf

Neural Information Processing SystemsApr-25-2026, 05:11:26 GMT

artificial intelligence, machine learning, target model, (13 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (1.00)

Industry: Education (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

29440165fee0471389ba3f80a7b3f95f-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-8-2026, 01:05:13 GMT

accuracy, experiment, simt, (13 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.66)

Add feedback

29440165fee0471389ba3f80a7b3f95f-Paper-Conference.pdf

Neural Information Processing SystemsFeb-8-2026, 01:05:09 GMT

international conference, momentum network, target model, (11 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Jordan (0.04)
North America > United States > California (0.04)
Asia > South Korea > Gyeongsangbuk-do > Pohang (0.04)

Genre: Research Report > New Finding (1.00)

Industry: Education (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Meta-Learning with Self-Improving Momentum Target

Neural Information Processing SystemsDec-23-2025, 22:52:29 GMT

The idea of using a separately trained target model (or teacher) to improve the performance of the student model has been increasingly popular in various machine learning domains, and meta-learning is no exception; a recent discovery shows that utilizing task-wise target models can significantly boost the generalization performance. However, obtaining a target model for each task can be highly expensive, especially when the number of tasks for meta-learning is large. To tackle this issue, we propose a simple yet effective method, coined Self-improving Momentum Target (SiMT). SiMT generates the target model by adapting from the temporal ensemble of the meta-learner, i.e., the momentum network. This momentum network and its task-specific adaptations enjoy a favorable generalization performance, enabling self-improving of the meta-learner through knowledge distillation. Moreover, we found that perturbing parameters of the meta-learner, e.g., dropout, further stabilize this self-improving process by preventing fast convergence of the distillation loss during meta-training. Our experimental results demonstrate that SiMT brings a significant performance gain when combined with a wide range of meta-learning methods under various applications, including few-shot regression, few-shot classification, and meta-reinforcement learning.

meta-learning, self-improving momentum target, target model, (6 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.97)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Meta-Learning with Self-Improving Momentum Target

Neural Information Processing SystemsOct-10-2024, 10:54:46 GMT

The idea of using a separately trained target model (or teacher) to improve the performance of the student model has been increasingly popular in various machine learning domains, and meta-learning is no exception; a recent discovery shows that utilizing task-wise target models can significantly boost the generalization performance. However, obtaining a target model for each task can be highly expensive, especially when the number of tasks for meta-learning is large. To tackle this issue, we propose a simple yet effective method, coined Self-improving Momentum Target (SiMT). SiMT generates the target model by adapting from the temporal ensemble of the meta-learner, i.e., the momentum network. This momentum network and its task-specific adaptations enjoy a favorable generalization performance, enabling self-improving of the meta-learner through knowledge distillation.

meta-learning, self-improving momentum target, target model, (3 more...)

Neural Information Processing Systems

Genre: Play > Prospect (0.60)

Industry: Education (0.42)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Self-trained Panoptic Segmentation

Verma, Shourya

arXiv.org Artificial IntelligenceNov-17-2023

Panoptic segmentation is an important computer vision task which combines semantic and instance segmentation. It plays a crucial role in domains of medical image analysis, self-driving vehicles, and robotics by providing a comprehensive understanding of visual environments. Traditionally, deep learning panoptic segmentation models have relied on dense and accurately annotated training data, which is expensive and time consuming to obtain. Recent advancements in self-supervised learning approaches have shown great potential in leveraging synthetic and unlabelled data to generate pseudo-labels using self-training to improve the performance of instance and semantic segmentation models. The three available methods for self-supervised panoptic segmentation use proposal-based transformer architectures which are computationally expensive, complicated and engineered for specific tasks. The aim of this work is to develop a framework to perform embedding-based self-supervised panoptic segmentation using self-training in a synthetic-to-real domain adaptation problem setting.

dataset, segmentation, semantic mask, (14 more...)

arXiv.org Artificial Intelligence

2311.10648

Country:

North America > United States (0.14)
North America > Canada > Quebec > Capitale-Nationale Region > Québec (0.04)
North America > Canada > Quebec > Capitale-Nationale Region > Quebec City (0.04)
(5 more...)

Genre: Research Report (1.00)

Industry:

Transportation > Ground > Road (0.93)
Health & Medicine > Diagnostic Medicine > Imaging (0.66)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Unsupervised Meta-Learning via Few-shot Pseudo-supervised Contrastive Learning

Jang, Huiwon, Lee, Hankook, Shin, Jinwoo

arXiv.org Artificial IntelligenceMar-2-2023

Unsupervised meta-learning aims to learn generalizable knowledge across a distribution of tasks constructed from unlabeled data. Here, the main challenge is how to construct diverse tasks for meta-learning without label information; recent works have proposed to create, e.g., pseudo-labeling via pretrained representations or creating synthetic samples via generative models. However, such a task construction strategy is fundamentally limited due to heavy reliance on the immutable pseudo-labels during meta-learning and the quality of the representations or the generated samples. To overcome the limitations, we propose a simple yet effective unsupervised meta-learning framework, coined Pseudo-supervised Contrast (PsCo), for few-shot classification. We are inspired by the recent self-supervised learning literature; PsCo utilizes a momentum network and a queue of previous batches to improve pseudo-labeling and construct diverse tasks in a progressive manner. Our extensive experiments demonstrate that PsCo outperforms existing unsupervised meta-learning methods under various in-domain and cross-domain few-shot classification benchmarks. We also validate that PsCo is easily scalable to a large-scale benchmark, while recent prior-art meta-schemes are not. Learning to learn (Thrun & Pratt, 1998), also known as meta-learning, aims to learn general knowledge about how to solve unseen, yet relevant tasks from prior experiences solving diverse tasks. In recent years, the concept of meta-learning has found various applications, e.g., few-shot classification (Snell et al., 2017; Finn et al., 2017), reinforcement learning (Duan et al., 2017; Houthooft et al., 2018; Alet et al., 2020), hyperparameter optimization (Franceschi et al., 2018), and so on.

artificial intelligence, benchmark, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2303.00996

Country:

North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Europe > Austria (0.04)
(14 more...)

Genre: Research Report > New Finding (0.46)

Industry:

Education (0.48)
Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

State of Meta Learning in 2022 part2(Artificial Intelligence)

#artificialintelligenceOct-16-2022, 19:36:31 GMT

Abstract: Meta-Learning has emerged as a research direction to better transfer knowledge from related tasks to unseen but related tasks. However, Meta-Learning requires many training tasks to learn representations that transfer well to unseen tasks; otherwise, it leads to overfitting, and the performance degenerates to worse than Multi-task Learning. We show that a state-of-the-art data augmentation method worsens this problem of overfitting when the task diversity is low. We propose a simple method, TaskMix, which synthesizes new tasks by linearly interpolating existing tasks. We compare TaskMix against many baselines on an in-house multilingual intent classification dataset of N-Best ASR hypotheses derived from real-life human-machine telephony utterances and two datasets derived from MTOP.

artificial intelligence, meta-learning, target model, (14 more...)

#artificialintelligence

Genre: Research Report > New Finding (0.32)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Meta-Learning with Self-Improving Momentum Target

Tack, Jihoon, Park, Jongjin, Lee, Hankook, Lee, Jaeho, Shin, Jinwoo

arXiv.org Artificial IntelligenceOct-11-2022

The idea of using a separately trained target model (or teacher) to improve the performance of the student model has been increasingly popular in various machine learning domains, and meta-learning is no exception; a recent discovery shows that utilizing task-wise target models can significantly boost the generalization performance. However, obtaining a target model for each task can be highly expensive, especially when the number of tasks for meta-learning is large. To tackle this issue, we propose a simple yet effective method, coined Self-improving Momentum Target (SiMT). SiMT generates the target model by adapting from the temporal ensemble of the meta-learner, i.e., the momentum network. This momentum network and its task-specific adaptations enjoy a favorable generalization performance, enabling self-improving of the meta-learner through knowledge distillation. Moreover, we found that perturbing parameters of the meta-learner, e.g., dropout, further stabilize this self-improving process by preventing fast convergence of the distillation loss during meta-training. Our experimental results demonstrate that SiMT brings a significant performance gain when combined with a wide range of meta-learning methods under various applications, including few-shot regression, few-shot classification, and meta-reinforcement learning.

artificial intelligence, machine learning, target model, (15 more...)

arXiv.org Artificial Intelligence

2210.05185

Country: